PROGRAMMABLE HORIZONTAL FILTER WITH NOISE REDUCTION 
AND IMAGE SCALING FOR VIDEO ENCODING SYSTEM 

Cross-Reference to Related Applications 

[0001] This application contains subject matter which is 
related to the subject matter of the following applications, 
each of which is assigned to the same assignee as this 
application and filed on the same day as this application. 
Each of the below-listed applications is hereby incorporated 
herein by reference in its entirety: 

[0002] "Programmable Vertical Filter for Video Encoding", 

by Ngai et al., U.S. Serial No. (Attorney Docket 

No. END920010082) ; and 

[0003] "Programmable And Adaptive Temporal Filter For 
Video Encoding", by Murdock et al., U.S. Serial No. 
(Attorney Docket No. END920010083 ) . 

Technical Field 

[0004] This invention relates in general to compression 
of digital visual images, and more particularly, to a 
technique for programmably, spatially horizontally filtering 
pixel values of a plurality of motion video frames, 
including selectively performing noise filtering and image 
scaling thereof using dynamically programmable filter 
coefficients . 
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Background of the Invention 



[0005] Technological advances in digital transmission 
networks, digital storage media, very large scale 
integration devices, and digital processing of video and 
audio signals have been converging to make the transmission 
and storage of digital video economical in a wide variety of 
applications. Because the storage and transmission of 
digital video signals is central to many applications, and 
because an uncompressed representation of a video signal 
requires a large amount of storage, the use of digital video 
compression techniques is vital to this advancing art. In 
this regard, several international standards for the 
compression of digital video signals have emerged over the 
past decade, with more currently under development. These 
standards apply to algorithms for the transmission and 
storage of compressed digital video in a variety of 
applications, including: video-telephony and 
teleconferencing; high quality digital television 
transmission on coaxial and fiberoptic networks, as well as 
broadcast terrestrially and other direct broadcast 
satellites; and in interactive multimedia products on CD- 
ROM, Digital Audio Tape, and Winchester disk drives. 

[0006] Several of these standards involve algorithms 
based on a common core of compression techniques, e.g., the 
CCITT (Consultative Committee on International Telegraphy 
and Telephony) Recommendation H.120, the CCITT 
Recommendation H.261, and the IS0/IEC MPEG-1 and MPEG- 2 
standards. The MPEG algorithms have been developed by the 
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Moving Picture Experts Group (MPEG), part of a joint 
technical committee of the International Standards 
Organization (ISO) and the International Electrotechnical 
Commission (IEC) . The MPEG committee has been developing 
standards for the multiplexed, compressed representation of 
video and associated audio signals. 

[0007] The MPEG-2 standard describes an encoding method 
that results in substantial bandwidth reduction by a 
subjective lossy compression followed by a lossless 
compression. The encoded, compressed digital data is 
subsequently decompressed and decoded in an MPEG-2 compliant 
decoder. The MPEG-2 standard specifies a very high 
compression technique that achieves compression not 
achievable with intraframe coding alone, while preserving 
the random access advantages of pure intraframe coding. The 
combination of frequency domain intraframe encoding and 
interpolative/predictive interframe encoding of the MPEG-2 
standard results in a balance between intraframe encoding 
and interframe encoding. 

[0008] The MPEG-2 standard exploits temporal redundancy 
for motion compensated interpolative and predictive 
encoding. That is, an assumption is made that "locally" the 
current picture can be modeled as a translation of the 
picture at a previous and/or future time. "Locally" implies 
that the amplitude and direction of the displacement are not 
the same everywhere in the picture. 
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[0009] The MPEG-2 standard further specifies predictive 
and interpolative interframe encoding and frequency domain 
intraframe encoding. It has block-based motion compensation 
for the reduction of temporal redundancy and discrete cosine 
transform based compression for the reduction of spatial 
redundancy. Under MPEG-2, motion compensation is achieved 
by predictive coding, interpolative coding, and variable 
length coded motion vectors. The information relative to 
motion is based on a 16x16 array of pixels and is 
* r t transmitted with the spatial information. It is compressed 

Q with variable length codes, such- as Huffman codes. 

-:A § 

s S 

^ [0010] The ISO MPEG-2 compression standard specifies only 

J! the syntax of bitstream and semantics of the decoding 

^ process. The choice of coding parameters and trade-offs in 

1=1 

fjj performance versus complexity are left to the encoder 

FU developers. 

Disclosure of the Invention 

[0011] In an MPEG-2 video compression system, it is 
desirable to achieve a highest possible video quality at a 
lowest possible compressed output bit rate. One method to 
help achieve this is to eliminate or substantially reduce 
any inherent "noise" in the incoming uncompressed video 
signal. Noise that is present in the input video signal can 
manifest itself as a distortion of the digitized luminance 
and chrominance input pixel values. This can create 
irregular patterns of digital values (i.e., an 
"unsmoothness" or "irregularity") in the input pixel data. 
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Irregular or noisy digitized pixel values make it difficult 
for an video encoder to perform optimal compression, 
especially at low bit rates. This is because MPEG 
compression, or any data compression scheme, depends on a 
high degree of correlation in the incoming data stream to 
establish redundancies (both spatially and temporally in the 
case of MPEG) . 

[0012] The more correlated the data, the greater the 
redundancies, and thus the higher the compression 
capability. The less correlated. the data, the less 
redundancies exist, and thus, the more difficult it is to 
compress. Eliminating noise in the incoming video signal 
creates a "smoothness" to the picture, as well as enhancing 
compression. This characteristic is especially significant 
when scaling the input video from standard full resolution, 
to a fraction of its original image size. A re-sized 
picture image will appear smoother and will compress better 
when noise filtering is performed during the scaling 
process . 

[0013] Briefly summarized, the present invention thus 
comprises in one aspect a method of filtering pixels of 
video frames of a plurality of video frames, e.g., for 
enhancing video encoding thereof. The method includes 
obtaining pixel values of video frames of a plurality of 
video frames; and programmably, spatially horizontally 
filtering the pixel values of the video frames. 
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[0014] In enhanced aspects, the horizontally filtering 
includes programmably noise filtering or noise filtering and 
image scaling the pixel values of the video frames. The 
horizontally filtering can further include obtaining filter 
coefficients for use in filtering, wherein the obtaining 
might include dynamically changing and/or switching filter 
coefficients during the filtering process. 

[0015] In another aspect, a method of processing pixels 
of video frames is provided. This method includes filtering 
pixel values of at least one video frame; and controlling 
the filtering so as to selectively perform spatial noise 
reduction or spatial noise reduction and image scaling on 
the pixel values of the at least one video frame. 

[0016] Systems and computer program products 
corresponding to the above-summarized methods are also 
described and claimed herein. 

[0017] To restate, provided herein is a technique for 
programmably horizontally filtering pixel values of frames 
of a sequence of motion video frames in real time. The 
technique is in part programmable since the coefficients 
employed in the filtering process are programmable and may 
be dynamically changed and/or switched by a user during the 
filtering process. For example, one or more sets of the 
coefficients could be changed or switched by a user on a per 
picture basis if desired. Such an embodiment would allow a 
user to improve picture quality dynamically using pre-loaded 
or user inputted sets of coefficients. 
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[0018] In one embodiment, an integrated horizontal 
filter/image scaler device is presented which is integrated 
into the front end of a video compression engine (e.g., 
within a digital video input interface thereof) . This 
filter/sealer is advantageous in that it can perform 
operations in "real time" (for example, 30 frames/sec for 
NTSC video standard input) . The device does not require 
large amounts of input pixel data buffering, since 
filtering/scaling can be performed on groups of input pixels 
"on the fly", as the video image. is scanned in horizontally, 
line by line. The filter/sealer device adds no extra 
latency to the input picture processing of the video 
encoding engine. 

[0019] Another advantage is that the filter/sealer device 
has an ability to "program in" filter coefficients from 
multiple filters when the encoding system is initially 
powered on. With multiple filters loaded and resident in 
the device, a user can then switch between filter and scaler 
options, for example, on input picture (frame) boundaries. 
This can be desirable since a user may want to account for 
scene changes in the video source, or to account for new 
video sequences in the source, e.g., caused by on the fly 
input video source switching. A further advantage to using 
a filter/sealer device as described herein is that new 
filter coefficients can be loaded dynamically (i.e., on the 
fly) while the filter system is busy filtering, with another 
filter set already in use. This new set of filter 
coefficients (i.e., the newly loaded coefficients) can then 
be switched in at an input frame boundary as noted above. 
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[0020] In addition, the horizontal filter mechanism 
disclosed herein is programmable to perform noise removal 
only, or noise removal in combination with horizontal image 
resizing. When employed for image resizing, multiple filter 
phases are applied to the groups of pixels being filtered. 

[0021] Additional features and advantages are realized 
through the techniques of the present invention. Other 
embodiments and aspects of the invention are described in 
detail herein and are considered- a part of the claimed 
invention. 

Brief Description of the Drawings 

[0022] The subject matter which is regarded as the 
invention is particularly pointed out and distinctly claimed 
in the claims at the conclusion of the specification. The 
foregoing and other objects, features, and advantages of the 
invention are apparent from the following detailed 
description taken in conjunction with the accompanying 
drawings in which: 

[0023 ] FIG. 1 shows a flow diagram of a generalized 

MPEG-2 compliant encoder 11, including a discrete 
cosine transformer 21, a quantizer 23, a variable 
length coder 25, an inverse quantizer 29, an 
inverse discrete cosine transformer 31, motion 
compensation 41, frame memory 42, and motion 
estimation 43. The data paths include the i th 
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picture input 111, difference data 112, motion 
vectors 113 (to motion compensation 41 and to 
variable length coder 25), the picture output 121, 
the feedback picture for motion estimation and 
compensation 131, and the motion compensated 
picture 101. This figure has the assumptions that 
the i th picture exists in frame memory or frame 
store 42 and that the i+l th is being encoded with 
motion estimation . 

FIG. 2 illustrates the I, P, and B pictures, 
examples of their display and transmission orders, 
and forward, and backward motion prediction. 

FIG. 3 illustrates the search from the motion 
estimation block in the current frame or picture 
to the best matching block in a subsequent or 
previous frame or picture. Elements 211 and 211 1 
represent the same location in both pictures. 

FIG. 4 illustrates the movement of blocks in 
accordance with the motion vectors from their 
position in a previous picture to a new picture, 
and the previous picture f s blocks adjusted after 
using motion vectors. 

FIG. 5 illustrates one embodiment of a pixel 
interface integrated within a digital video 
encoder, in accordance with an aspect of the 
present invention. 



[0028] FIG. 6 depicts in greater detail one 

embodiment of the pixel interface of FIG . 5, in 
accordance with an aspect of the present 
invention. 



[0029] FIG. 7 illustrates in greater detail one 

embodiment of the horizontal filter of FIG. 6, in 
accordance with an aspect of the present 
invention. 

[0030] FIG. 8 depicts one embodiment of the 

horizontal noise reduction filter of FIG. 7 , in 
accordance with an aspect of the present 
invention. 



[0031] FIG. 9 depicts one embodiment of the 

horizontal image scaler of FIG. 1 , in accordance 
with an aspect of the present invention. 

[0032] FIG. 10 illustrates one embodiment of a 

process for programmably horizontally filtering 
pixel values, in accordance with an aspect of the 
present invention. 



[0033] FIG. 11 illustrates one embodiment of a 

process for calculating luminance & chromanance 
horizontally filtered pixels and for performing 
image scaling, in accordance with an aspect of the 
present invention. 
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[0034] FIG. 12 depicts one embodiment of horizontal 

spatial noise reduction filtering for luminance 
data only, in accordance with an aspect of the 
present invention. 

[0035] FIG. 13 illustrates one embodiment of 

horizontal noise reduction filtering and image 
scaling for luminance data only, in accordance 
with an aspect of the present invention. 



[0036] FIGS. 14A & 14B illustrate another embodiment 

of a process for programmably horizontally 
filtering pixel values, in accordance with an 
aspect of the present invention. 



Best Mode for Carrying Out the Invention 



[0037] The invention relates, for example, to MPEG 
compliant encoders and encoding processes such as described 
in "Information Technology-Generic coding of moving pictures 
and associated- audio information: Video," Recommendation 
ITU-T H.262, ISO/IEC 13818-2, International Standard, 1996. 
The encoding functions performed by the encoder include data 
input, spatial compression, motion estimation, macroblock 
type generation, data reconstruction, entropy coding, and 
data output. Spatial compression includes discrete cosine 
transformation (DCT) , quantization, and entropy encoding. 
Temporal compression includes intensive reconstructive 
processing, such as inverse discrete cosine transformation, 
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inverse quantization, and motion compensation . Motion 
estimation and compensation are used for temporal 
compression functions. Spatial and temporal compression are 
repetitive functions with high computational requirements. 

[0038] More particularly the invention relates, for 
example, to a process for performing spatial and temporal 
compression including discrete cosine transformation, 
quantization, entropy encoding, motion estimation, motion 
H compensation, and prediction, and even more particularly to 

q a system for accomplishing spatial and temporal compression. 

El 

i. 

s s 

%l [0039] The first compression step is the elimination of 

*I spatial redundancy, for example, the elimination of spatial 

3 redundancy in an "I" frame picture. Spatial redundancy is 

ni th e redundancy within a picture. The MPEG-2 Standard uses a 

fy block based method of reducing spatial redundancy. The 

PI method of choice is the discrete cosine transformation, and 

fl! discrete cosine transform coding of the picture. Discrete 

cosine transform coding is combined with weighted scalar 
quantization and run length coding to achieve a desirable 
compression. 

[0040] The discrete cosine transformation is an 
orthogonal transformation. Orthogonal transformations, 
because they have a frequency domain interpretation, are 
filter bank oriented. The discrete cosine transformation is 
also localized. That is, the encoding process samples on an 
8x8 spatial window which is sufficient to compute 64 
transform coefficients or sub-bands. 



END920010078 



-12- 



[0041] Another advantage of the discrete cosine 
transformation is that fast encoding and decoding algorithms 
are available. Additionally, the sub-band decomposition of 
the discrete cosine transformation is sufficiently well 
behaved to allow effective use of psychovisual criteria. 

[0042] After transformation, many of the frequency 
coefficients are zero, especially the coefficients for high 
H spatial frequencies. These coefficients are organized into a 

p zig-zag or alternate-scanned pattern, and converted into 

|j run-amplitude (run-level) pairs. Each pair indicates the 

%| number of zero coefficients and the amplitude of the non- 

J|J zero coefficient. This is coded in a variable length code. 

Jjt [0043] Motion compensation is used to reduce or even 

[il eliminate redundancy between pictures. Motion compensation 

g exploits temporal redundancy by dividing the current picture 

fU into blocks, for example, macroblocks, and then searching in 

previously transmitted pictures for a nearby block with 
similar content. Only the difference between the current 
block pels and- the predicted block pels extracted from the 
reference picture is actually compressed for transmission 
and thereafter transmitted. 

[0044] One method of motion compensation and prediction 
is to record the luminance and chrominance, i.e., intensity 
and color, of every pixel in an "I" picture, then record 
changes of luminance and chrominance, i.e., intensity and 
color for every specific pixel in the subsequent picture. 
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However, this is uneconomical in transmission medium 
bandwidth, memory, processor capacity, and processing time 
because objects move between pictures, that is, pixel 
contents move from one location in one picture to a 
different location in a subsequent picture. A more advanced 
idea is to use a previous or subsequent picture to predict 
where a block of pixels will be in a subsequent or previous 
picture or pictures, for example, with motion vectors, and 
to write the result as "predicted pictures" or "P" pictures. 
More particularly, "this involves making a best estimate or 
prediction of where the pixels or macroblocks of pixels of 
the i fch picture will be in the i-l th or i+l th picture. It is 
one step further to use both subsequent and previous 
pictures to predict where a block of pixels will be in an 
intermediate or "B" picture. 

[0045] To be noted is that the picture encoding order and 
the picture transmission order do not necessarily match the 
picture display order. See FIG. 2. For I-P-B systems the 
input picture transmission order is different from the 
encoding order, and the input pictures must be temporarily 
stored until used for encoding. A buffer stores this input 
until it is used. 

[0046] For purposes of illustration, a generalized 
flowchart of MPEG compliant encoding is shown in FIG. 1. In 
the flowchart, the images of the i th picture and the i+l th 
picture are processed to generate motion vectors. The 
motion vectors predict where a macroblock of pixels will be 
in a prior and/or subsequent picture. The use of the motion 
vectors is a key aspect of temporal compression in the MPEG 
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standard. As shown in FIG . 1 the motion vectors, once 
generated, are used for the translation of the macroblocks 
of pixels, from the i th picture to the i+l th picture. 

[0047] As shown in FIG. 1, in the encoding process, the 
images of the i th picture and the i+l th picture are processed 
in the encoder 11 to generate motion vectors which are the 
form in which, for example, the i+l th and subsequent 
pictures are encoded and transmitted. An input image 111 of 
a subsequent picture goes to the motion estimation unit 43 
of the encoder. Motion vectors 113 are formed as the output 
of the motion estimation unit 43. These vectors are used by 
the motion compensation unit 41 to retrieve macroblock data 
from previous and/or future pictures, referred to as 
"reference" data, for output by this unit. One output of the 
motion compensation unit 41 is negatively summed with the 
output from the motion estimation unit 43 and goes to the 
input of the discrete cosine transformer 21. The output of 
the discrete cosine transformer 21 is quantized in a 
quantizer 23. The output of the quantizer 23 is split into 
two outputs, 121 and 131; one output 121 goes to a 
downstream element 25 for further compression and processing 
before transmission, such as to a run length encoder; the 
other output 131 goes through reconstruction of the encoded 
macroblock of pixels for storage in frame memory 42. In the 
encoder shown for purposes of illustration, this second 
output 131 goes through an inverse quantization 29 and an 
inverse discrete cosine transform 31 to return a lossy 
version of the difference macroblock. This data is summed 
with the output of the motion compensation unit 41 and 
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returns a lossy version of the original picture to the frame 
memory 42. 

[0048] As shown in FIG. 2, there are three types of 
pictures. There are "Intra pictures" or "I" pictures which 
are encoded and transmitted whole, and do not require motion 
vectors to be defined. These "I" pictures serve as a 
reference image for motion estimation. There are "Predicted 
pictures" or "P" pictures which are formed by motion vectors 
from a previous picture and can serve as a reference image 
for motion estimation for further pictures. Finally, there 
are "Bidirectional pictures" or "B" pictures which are 
formed using motion vectors from two other pictures, one 
past and one future, and can not serve as a reference image 
for motion estimation. Motion vectors are generated from 
"I" and "P" pictures, and are used to form "P" and "B" 
pictures . 

[0049] One method by which motion estimation is carried 
out, shown in FIG. 3, is by a search from a macroblock 211 
of an i th picture throughout a region of the next picture to 
find the best match macroblock 213. Translating the 
macroblocks in this way yields a pattern of macroblocks for 
the i+l th picture, as shown in FIG. 4. In this way the i th 
picture is changed a small amount, e.g., by motion vectors 
and difference data, to generate the i+l th picture. What is 
encoded are the motion vectors and difference data, and not 
the i+l th picture itself. Motion vectors translate position 
of an image from picture to picture, while difference data 
carries changes in chrominance, luminance, and saturation, 
that is, changes in shading and illumination. 
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[0050] Returning to FIG. 3, processing searches for a 
good match by starting from the same location in the i th 
picture as in the i+l th picture. A search window is created 
in the i th picture. We search for a best match within this 
search window. Once found, the best match motion vectors for 
the macroblock are coded. The coding of the best match 
macroblock includes a motion vector, that is, how many 
pixels in the y direction and how many pixels in the x 
direction is the best match displaced in the next picture. 
Also encoded is difference data, also referred to as the 
"prediction error", which is the difference in chrominance 
and luminance between the current macroblock and the best 
match reference macroblock. 

[0051] The operational functions of an MPEG-2 encoder are 
discussed in further detail in United States Letters Patent 
No. 6,118,823 by Carr et al . , entitled "Control Scheme For 
Shared-Use Dual-Port Predicted Error Array, " which is hereby 
incorporated herein by reference in its entirety. 

[0052] As noted above, horizontal filtering can be 
employed to soften pictures to an encode process, and 
thereby enhance data compression. A softened picture is one 
whose pixel values have been smoothed by such a function, 
allowing the encode process to proceed easier and the output 
to be more visually appealing. Further, a resized picture 
image will appear smoother and will compress more optimally 
when noise filtering is performed during the scaling 
process . 
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[0053] Presented below are certain horizontal filter 
implementations in accordance with aspects of the present 
invention. 

[0054] In one aspect, a horizontal filter is provided 
herein which employs both luminance coefficients and 
chrominance coefficients to dynamically perform horizontal 
noise reduction filtering, or horizontal noise reduction 
filtering in combination with image scaling. In one 
embodiment, this horizontal filter may be implemented on- 
chip (e.g., comprise hardware integrated onto a digital 
video encoder chip) . This integration eliminates any need 
for a user to perform external horizontal filtering and, in 
accordance with one embodiment of the invention, allows a 
user to improve picture quality dynamically by using and 
switching between pre-loaded sets of filter coefficients or 
dynamically inputted sets of luminance and chrominance 
filter coefficients, 

[0055] One embodiment of a video encode system, generally 
denoted 500, in accordance with an aspect of the present 
invention is depicted in FIG. 5. System 500 includes a 
video encoder 510 and an external frame store 520. Video 
encoder 510 includes pixel interface logic 530 for 
preprocessing inputted picture data, an input memory 540 and 
encoder function logic 550. In one embodiment, both pixel 
interface 530 and encoder function 550 output data to and 
receive data from frame store 520. The current macroblock 
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data fed into encoder function 550 is output from function 
550 as variable length coded data as discussed above. 

[0056] One embodiment of pixel interface 530 for video 
encoder 510 of system 500 is depicted in FIG. 6, wherein 
pixel interface 530 is shown to include a pre-processing 
unit 600. In addition to unit 600, the pixel interface 
might also include pixel input logic (not shown) for 
receiving raw pixel data for feeding to the pre-processing 
unit, and an output" buffer (not shown) disposed between the 
output of pre-processing unit 600 and, for example, input 
memory 540 of the encoder of FIG. 5. 

[0057] As shown in FIG. 6, preprocessing unit 600 
includes a horizontal filter 610, a vertical filter 620 and 
a temporal filter 630. Temporal filter 630 outputs filter 
data to and retrieves filter data from frame store 520. The 
retrieved filter data comprises the filtered output of a 
previous picture, and is used in temporal filtering the next 
picture. Described hereinbelow are various embodiments of a 
horizontal filter 610. Filters 620 & 630 are optional and 
are merely provided by way of illustration. An example of a 
vertical filter is provided in the above-incorporated patent 
application entitled "Programmable Vertical Filter For Video 
Encoding", by Ngai et al., while an example of a temporal 
filter is described in the above-incorporated patent 
application entitled "Programmable And Addaptive Temporal 
Filter For Video Encoding", by Murdock et al. 



END920010078 



-19- 



[0058] Note that depending upon the video source, a user 
may program or choose whether to horizontally filter, 
vertically filter, and/or temporally filter a given set of 
video frames. The horizontal filter and the vertical filter 
comprise spatial filters, while the temporal filter is a 
time-bas^d filtering between frames of a video sequence. 
Further, although depicted in FIG. 6 as integrated within a 
video encoder, those skilled in the art should recognize 
that the horizontal filter, vertical filter and temporal 
filter could collectively or individually comprise 
preprocessing logic disposed outside of the video encoder. 
That is, the horizontal filter described and claimed herein 
could reside within the video encoder or external to the 
video encoder as preprocessing logic. 

[0059] FIG. 7 depicts one embodiment of a horizontal 
filter 610 in accordance with an aspect of the present 
invention. Ho rizontal filter 610 includes horizontal noise 
reduction filter logic 700 and horizontal image scaler logic 
710. A horizontal filter coefficient buffer 720 holds 
multiple sets of filter coefficients, while read/write 
control 730 controls the reading of filter coefficients from 
buffer 720 and the writing of filter coefficients to buffer 
720. A programmable write path is provided to read/write 
control 730, buffer 720 and horizontal image scaler 710 for 
programmable control of the horizontal filter in accordance 
with certain aspects of the present invention. 

[0060] For example, the programmable write path signal, 
which can be generated by an external user or by internal 
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code, can control the horizontal filter for noise reduction 
filtering only of input pixels, or noise reduction filtering 
of input pixels combined with horizontal image scaling 
thereof, or even no filtering (in which case the non- 
filtered pixels are simply passed through filter 610} . 
Further, this programmable write path could be employed, for 
example, by a user to load different filter coefficients 
into buffer 720 for subsequent use by the noise reduction 
filter 700. In accordance with the present invention, 
filter coefficients for noise reduction could be adjusted on 
a per picture basis, if desired. - For example, should the 
video source change to a more noisy input, it might be 
desirable to dynamically switch the filter coefficients to 
increase the noise filtering. One embodiment of horizontal 
noise reduction filter 700 is depicted in FIG. 8, while an 
embodiment of horizontal image scaler 710 is depicted in 
FIG. 9. 

[0061] As shown in FIG. 8, the filter coefficients from 
buffer 720 are held in internal coefficient 

registers/controls logic 800 for use by the horizontal noise 
reduction filter 700. In accordance with an aspect of the 
present invention, these filter coefficients can be 
dynamically switched during the filtering process such that 
(if desired) a different set of filter coefficients could be 
employed with each new video frame undergoing noise 
filtering. 

[0062] The input pixel data is initially received in a 
pixel register and control logic 810. Assuming that 
filtering is enabled, a desired number of horizontal pixels, 
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for example, 8 in the case of luminance data, and 4 in the 
case of Cb and Cr chrominance data, are obtained and 
multiplied by a respective filter coefficient within 
multiply unit 820. If filtering is not enabled, then the 
input pixels are simply passed through filter 700 and output 
as non-filtered pixels. If filtering, the products from 
multiply unit 820 are summed by add unit 830, and divided 
by, for example, 256 in a divide/shift unit 840 . Bit 
truncation 850 is next performed to yield a final 8 bit 
pixel result. The result is placed in output buffer 860 for 
forwarding to horizontal image scaler 710 as filtered pixel 
output . 

[0063] FIG. 9 depicts one embodiment of a horizontal 
image scaler in accordance with the principles of the 
present invention. Image scaler 710 receives non-filtered 
pixels or noise reduced pixels from, for example, horizontal 
noise reduction filter 700, such as depicted in FIG. 8. If 
non-filtered pixels are received, then the pixels are 
forwarded directly to an output multiplexer/buffer 920 for 
output as non-filtered pixels. If noise filtering only, 
then the filtered pixels are passed directly to output 
multiplexer/buffer 920. Otherwise, when image scaling, the 
filtered pixels are passed through a decimation unit 900, 
which is controlled by control logic 910. Control logic 910 
receives input signals via the programmable write path. 
Output of decimation unit 900 is fed to output 
multiplexer/buffer 920 for forwarding to subsequent 
processing logic as described above. 
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[0064] Decimation unit 900 decimates the filtered pixels 
in accordance with programmed control signals. That is, the 
filtered pixels undergo image scaling in accordance with one 
of multiple possible resizing functions programmed into the 
image scaler, as explained further below. Decimation unit 
900 essentially drops filtered pixels passing through the 
image scaler in accordance with a predefined cadence as 
defined by controls 910. 

[0065] Additionally, in accordance with an aspect of the 
present invention, multiple phases are employed when noise 
filtering pixels to undergo image scaling. These phases, 
labeled "0-N" in FIG. 8, comprise different sets of filter 
coefficients, which are employed against the pixels of a 
group of spatially horizontal pixels being filtered. This 
use of filter phases, or different sets of filter 
coefficients, is also explained further below. 

[0066] FIG. 10 depicts one flowchart embodiment of a 
horizontal filter process 1000 in accordance with the 
present invention. Process 1000 assumes that luminance 
default coefficients and chrominance default coefficients 
are stored 1010, for example, in a horizontal filter 
coefficients buffer. Upon receipt of an input picture, 
processing determines whether horizontal filtering is 
enabled 1030. If "no", then the received pixel values are 
simply forwarded to subsequent logic, such as a vertical 
filter 1080. 
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[0067] Assuming that horizontal filtering is enabled, 
processing determines whether horizontal filter coefficients 
have been loaded by a user 1040. If "no", then default 
coefficients are employed/selected 1050. Otherwise, the 
user defined coefficients, e.g., loaded through a host 
interface, are used in the horizontal filter logic. The 
horizontally filtered luminance and chrominance pixel values 
are then calculated 1070, using either the user defined set 
of filter coefficients or a default set of filter 
coefficients. Once calculated, the filtered pixel values 
are forwarded, in one embodiment, to the vertical filter 
1080. 

[0068] Note that in one embodiment, the default filter 
coefficients could comprise multiple sets of default 
coefficients preloaded into registers as described herein. 
In such an example, a user system could select among the 
sets of filter coefficients (e.g., on a per picture basis), 
or alternatively, could load through the host interface 
customized luminance and chrominance filter coefficients for 
use in the horizontal filtering process. 

[0069] Further details of a horizontal noise filter/image 
scaler in accordance with aspects of the present invention 
are discussed below in connection with FIGs . 11-13. 

[0070] Referring to the filter/sealer processing 
embodiment of FIG. 11, as an input video image is scanned in 
horizontally, a group of 8 luminance pixels and four each Cb 
and Cr chrominance pixels is formed, i.e., in one example. 
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This data is held in an input pixel register 1110. Each 
pixel within the group, is then multiplied 1130 by a 
corresponding filter coefficient (or filter tap) programmed 
in (in one embodiment) at power on/initialization time. 
These filter coefficients are held in the internal 
coefficient registers of the horizontal noise reduction 
filter 1120. 

[0071] More particularly, the 8 luminance pixels in a 
group of pixels are multiplied by 8 luminance filter 
coefficients and the 4 Cb and 4 Cr chrominance pixels are 
each multiplied by the 4 chrominance filter coefficients. 
The luminance and chrominance filter coefficients are 
distinct, that is, there is one coefficient filter for 
luminance values, and another separate filter for 
chrominance values. In one embodiment, the filter 
coefficients are 9 bit signed values. 

[0072] The signed results of the multiplication process 
are then summed 1140, and divided by a normalized value 
1150. By way of example, the normalized value might 
comprise 256. .-Since 256 is assumed to be the normalized 
value, the sum of the programmable filter coefficients will 
be 256 as well. A bit truncation and saturation 1160 is 
performed to yield a final 8 bit pixel result; that is, a 
negative pixel value is capped as a zero value, while a 
value greater than 255 is capped at 255. These new filtered 
luminance and chrominance pixel values then represent the 
first luminance and chrominance pixel values in the line. 
The filtered pixel values are placed in an output buffer 
1170 of the noise reduction filter, as described above. 
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[0073] To generate the next filtered luminance and 
chrominance pixel values, a new group of 8/4 original 
luminance/chrominance pixel values is formed by shifting one 
original pixel value to the right. For example, original 
pixels 1-8 might form one group, while original pixels 2-9 
form a next group, and so on. 

[0074] In actuality, at the beginning of a line, certain 
pixels are repeated. For example reference FIG. 12, which 
depicts an example of noise reduction only for luminance 
components of the pixel values. In this example, the first 
groups of 8 luminance components comprise P1P1P1P1P2P3P4P5, 
P1P1P1P2P3P4P5P6, P1P1P2P3P4P5P6P7, P1P2P3P4P5P6P7P8 , 
P2P3P4P5P6P7P8P9, etc. Similarly, certain original pixels 
would be repeated at the end of a line to form the final 4 
original pixel groups at the end of the line. This will 
insure that the final number of filtered pixels matches the 
original horizontal size of the video image. 

[0075] After shifting to the right by one original pixel 
value, the entire process of multiplication, summing the 
results, dividing by 256, and truncation and saturation, is 
repeated. Thus, the next filtered luminance and chrominance 
pixel value is obtained. This process is repeated across an 
entire line to form a new horizontal line of filtered 
luminance and chrominance pixel values. It is these 
filtered values that are delivered downstream to the MPEG-2 
video encoding engine for compression. 



END920010078 



-26- 



[0076] As further shown in FIG. 11, in one embodiment, 
the filtered pixel values could optionally be fed through an 
image scaler for decimation 1180. For horizontal scaling of 
the filtered pixel values, the same process as described 
above is performed, except for the following differences. 

[0077] Each filtered luminance and chrominance pixel is 
created by applying separate "phases" of a programmable set 
of filter coefficients to each group of 8/4 original 
luminance/ chrominance pixels (again, assuming scaling is to 
be performed) . As used herein, a phase refers to a distinct 
and separate group of filter coefficients or taps. A filter 
used for scaling could include one or more phases, with each 
phase having 8 luminance filter coefficients and 4 
chrominance filter coefficients. For example, phase one of 
a filter might be applied against a first group of original 
luminance/ chrominance pixels, while phase two of a filter 
would be applied against a second group of original pixels, 
phase 3 applied to a third group, and so on, until the total 
number of phases in the filter has been cycled through. 

[0078] Once^this has occurred, one or more filtered 
pixels are "dropped", and the filter phase is cycled back to 
the beginning of the filter in use (i.e., back to phase 
one) . For example, for an image scaling option of 3/4 
resizing, the following operation occurs: phases 1, 2, and 3 
of the 3/4 filter are applied against pixel groups 1, 2, and 
3 of original pixels, respectively. Three filtered pixels 
are created from this process. Since only three phases are 
defined for a 3/4 filter, a fourth filtered pixel is then 
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created, but "dropped" by the image scaling process. The 
phase then cycles back to phase one. At this point, a new 
group of three filtered pixels is created, with a forth 
filtered pixel being dropped as before. The pattern repeats 
itself to the end of the line. 

[0079] As a specific illustration, a noise filter/sealer 
device in accordance with an aspect of the present invention 
might be programmed to provide multiple single phase noise 
filter only options' (e.g., 6 or more) and multiple scaling 
options (e.g., 6 or more). (Alternatively, the first phase 
of each image scaler filter option could be used as a noise 
reduction only option.) As one example, the scaling options 
might be as follows: 

[0080] Option 1: 1/2 resizing which comprises a 



phase filter, with one out of every three filtered 
pixels being dropped. FIG. 13 depicts an example 
for luminance data only of a 2/3 horizontal image 
scaler, wherein every third filtered pixel is 
dropped from the output. 



single phase filter with every other filtered 
pixel being dropped. 



[0081] 



Option 2: 2/3 resizing which includes a 2 



[0082] 



Option 3: 3/4 resizing which includes a 3 
phase filter, with one out of every four filtered 
pixels being dropped. 
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[0083] 



Option 4: 8/9 resizing which includes an 8 
phase filter, with one out of every 9 filtered 
pixels being dropped. 



[0084] 



Option 5: 3/8 resizing which includes a 3 
phase filter, with 5 out of every 8 filtered 
pixels being dropped. 



[0085] 



Option 6: 9/16 resizing which includes a 9 
phase filter, with 7 out of every 16 filtered 
pixels being dropped. * 



[0086] 



For the above scaling options, a phase cadence 



counter may be employed, for example, as part of the 
internal control logic of the horizontal filter. The phase 
cadence counter would indicate when filtered pixels are to 
be dropped. As in the case of noise filtering only, when 
performing image scaling of filtered pixels that are not to 
be dropped, the filtered pixels are output 1190 (FIG. 11), 
for example, to an MPEG-2 encoding engine for compression. 

[0087] FIGS': 14A & 14B depict another embodiment of a 
process for programmably horizontally filtering pixel 
values, in accordance with aspects of the present invention 
This process, generally denoted 1400, begins with 
initialization 1405 of the video encode system. Default 
filter coefficients are loaded 1410, for example, by 
internal microcode. A horizontal filter options register i 
then set by a user 1415, and processing inquires whether 
noise filter/image scaler processing is enabled 1420. 
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[0088] As used herein, the horizontal filter options 
register can be used to enable filtering, and to select 
whether to perform noise reduction only or noise reduction 
and image scaling. If the filter is enabled with a write t 
this register, and the user does not load custom filter 
coefficients into the horizontal filter coefficients buffer 
then the filter logic uses default coefficients. The filte 
options register could employ a 4 bit decode to 
enable/disable horizontal filtering, to select among 
multiple noise reduction only filters, and to select among 
different scaling options, as discussed above. 

[0089] From inquiry 1420, if "no", then the noise 
filtering/image scaling processes are bypassed 1425. 

[0090] Assuming that noise filtering or noise filtering 
and image scaling is enabled, then processing determines 
whether the user wishes to load custom filter coefficients 
1430. If "yes", then the custom filter coefficients are 
loaded into a coefficients buffer 1435. In one embodiment, 
a horizontal filter load/write control register could be 
employed in this process, along with a custom filter 
coefficients register. The control register could be used 
to select a "filter id", as well as a "phase id" for the 
particular filter selected. The custom filter coefficients 
register would contain the custom coefficient values to be 
loaded (e.g., into the horizontal filter coefficients 
buffer) following a write to of the horizontal filter 
load/write control register. Consecutive writes to this 
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register may be employed, with one write for each of the 
coefficients in the filter* Note further that when writing 
custom coefficients, the sum of all taps within a given 
phase should equal 256 since the original pixel values are 
assumed to comprise 8 bits, i.e., 0-255 in value. All 
phases for a given filter option should also be written. 
Note that noise reduction only options are one phase (i.e., 
phase 0 of a corresponding re-size filter) . 

[0091] Processing next determines whether noise reduction 
only is programmed 1440. If so, - then pixel value filtering 
is performed using a single phase of filter coefficients, 
with the output bypassing the decimation unit of the image 
scaler 1445. Otherwise, both pixel value filtering and 
image scaling is performed using multiple filter coefficient 
phases 1450 as discussed above. 

[0092] As shown in FIG. 14B, the pixel value filtering 
1445 or pixel value filtering and image scaling 1450 
continues for an entire video frame 1455. While processing 
a current video frame, process 1400 determines whether the 
user wishes to- modify the horizontal filter options register 
settings 1460. If "yes", then the horizontal filter options 
register is newly set by the user 1465. Process 1400 -also 
determines whether the user wishes to load new custom filter 
coefficients 1470. If "yes", then the custom filter 
coefficients are loaded into a coefficients buffer by the 
user 1475. Again, the setting of the horizontal filter 
options register, and the loading of custom filter 
coefficients occur simultaneous to processing of a current 
video frame. Upon reaching a frame boundary, the new 
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horizontal filter options and/or custom filter coefficient 
can be switched in for use by the horizontal filter. A next 
input frame 1480 is obtained, and processing loops back to 
inquiry 1420 to repeat the process for the pixel values of 
this next input frame. 

[0093] As noted by the above examples, the filter/image 
scaler presented herein is programmable dynamically in a 
number of ways. For example, a user can program in custom 
filter coefficients for use by the noise reduction 
filter/image scaler, a user can select whether to perform 
noise reduction filtering/image scaling (e.g., a user can 
select whether to perform only noise reduction filtering, or 
both noise reduction filtering and image scaling) , and a 
user can dynamically change filter options and coefficients. 

[0094] To summarize, those skilled in the art will note 
that presented herein are various embodiments of a 
horizontal filter for filtering/scaling video pixels that 
are horizontally scanned in and are spatially adjacent to 
one another. The noise filter/image scaler presented is a 
multi-tap, multi-phase filter (i.e./ has multiple phases 
applied for image scaling) . This noise filter/image scaler 
device can be integrated into front end logic of a video 
compression engine, and does not require its own separate 
ASIC device. The filter/sealer is advantageous in that it 
performs operations in real time (e.g., 30 frames/sec for 
NTSC video std) . The device does not require large amount 
of input pixel data buffering since the filtering/scaling 
can be performed on groups of 8 or 4 input pixels "on the 
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fly", as the video images are scanned in horizontally, line 
by line. The filter/sealer device adds no extra latency to 
the input pixel processing. 

[0095] Further, the filter/sealer presented herein has 
the ability to program in filter coefficients for multiple 
filters (for example, when initially powered on) or through 
a user interface, can accept user inputted filter 
coefficients. With multiple filters loaded and resident on 
the device, a user can switch between filter coefficients 
and scaler options on the fly, for example, at input picture 
boundaries. This is desirable since the user may want to 
account for scene changes in the video source or account for 
new video sequences in that source, e.g., caused by on the 
fly input video source switching. Further, by using the 
filter/sealer presented herein, new filter coefficients may 
be loaded dynamically, on the fly, while the horizontal 
filtering of pixels continues. These newly loaded filter 
coefficients can then be switched in at a next input frame 
boundary as described above. 

[0096] The present invention can be included in an 
article of manufacture (e.g., one or more computer program 
products) having, for instance, computer usable media. The 
media has embodied therein, for instance, computer readable 
program code means for providing and facilitating the 
capabilities of the present invention. The article of 
manufacture can be included as a part of a computer system 
or sold separately. 
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[0097] Additionally, at least one program storage device 
readable by a machine, tangibly embodying at least one 
program of instructions executable by the machine to perform 
the capabilities of the present invention can be provided. 

[0098] The flow diagrams depicted herein are just 
examples* There may be many variations to these diagrams or 
the steps (or operations) described therein without 
departing from the spirit of the invention. For instance, 
the steps may be performed in a differing order, or steps 
may be added, deleted or modified. All of these variations 
are considered a part of the claimed invention. 

[0099] Although preferred embodiments have been depicted 

and described in detail herein, it will be apparent to those 
skilled in the relevant art that various modifications, 
additions, substitutions and the like can be made without 
departing from the spirit of the invention and these are 
therefore considered to be within the scope of the invention 
as defined in the following claims. 
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